Predicting

TESTING

           Truth
Prediction  Cory Wong Fearless Katzman Nate Vulfpeck Woody
  Cory Wong        31        3       1    6       20     0
  Fearless          0       15       0    3        4     0
  Katzman           1        0      20    1        8     7
  Nate              0        0       1   15        5     0
  Vulfpeck         16        3       5    3       30     3
  Woody             0        1       4    1        7    14

When constructing the KNN network one thing that drastically improved performance is using only the top 10 most influential features. Still there are some interesting observations to be made. Although this confusion matrix gives some insight, it is skewed towards the bigger categories like Vulfpeck and Cory wong, and it only shows numbers, and not a clear measure of performance.

Detailed analysis of performance

# A tibble: 6 x 3
  class     precision recall
  <fct>         <dbl>  <dbl>
1 Cory Wong     0.636  0.729
2 Fearless      0.778  0.636
3 Katzman       0.594  0.613
4 Nate          0.875  0.483
5 Vulfpeck      0.565  0.703
6 Woody         0.467  0.292

As with the KNN network we see in this Random Forest implementation the Fearless Flyers playlist is the most distinctive group in the corpus.

Importance of features


This shows the most important distinguishing features between the different playlists. Loudness seems to be the key distinguishing feature, followed by instrumentalness and energy. The rest of the top 10 is mainly timbre components, while the key of the song doesn’t seem to contribute all that much to the predictions.

Clustering


Here we try clustering in the 6 categories we defined at the beginning of this portfolio. As you can see the groups are certainly not pure. Nate smith seems to concentrate in the upper right, but is also in the lower clusters. I will analyse the exact classifications more precisely in the final portfolio.

Principal Components


In the previous slide we were shown the 6 clusters our random forest came up with. This is represented in our graph along two dimensions. In machine learning Principal Component Analysis is used to determine a linear combination of the features in our dataset that offer the highest rate of variability. In this circle we see the same two dimensions, but now plotted on top is the contribution of each of our features, the ones we know how to interpret. We can see that acousticness and energy are the biggest contributours for dimension 1, while c01 plays the largest role in dimension two.

Tempo

TESTING

Nate Smith’s Get Down


A tempogram of Nate Smith’s shows his impeccable timing and ability to play with tempo.

Nate Smith playing with time


Nate Smith’s incredible drumming

Nate Smith is a legendary drummer, and one of his strengths is his impeccable timing. As you can see in these plots the tempo of the song is straight as an arrow. As for Big/small Five, the song is in 5/4, but one of Nate Smith’s signature moves is playing a 4 beat rythm over 5/4 and keeping the timing perfect. Being used to western music you expect the song to change in an even number of measures, but because the song is in 5/4 and the change in the song happens after 7 measures you are completely blindsided. Not only does it not line up with the 4/4, but even in 5/4 the change seems one measure early.

You can also clearly see spotify struggles with his playing different feeling rythms over the same tempo. In the end of “paved” you can clearly see he slows down as a means to end the song.

Adrienne & Adrianne


Vulfpeck and Fearless flyer’s adrienne

There seem to be two rythms in this song. This is most likely caused by the clear 4/4 feel of the drums in comparison with the offset melody. The melody of the chorus starts on the off-beat of the first beat of the bar of the drums. The melody covers 2 and a half bars, resulting in a sort of 5/8 feel to the song. Due to the fact that the chorus ends on a rest that goes over the first beat of the verse this doesn’t make either tempo stand out as more likely.

Chords and Keys

Theo Katzman’s chords- You could be president by Theo Katzman


Here we see the chordogram for “You could be president” by Theo Katzman. A typical song in his style. It has a halftime beat and vocals, a repeating guitar riff as a melody and a more uptempo chorus. It also has a typical piano and guitar solo at 135 seconds. We can see clearly he sticks to the E flat 7 chord that the song is built on througout the entire song.

Cory Wong Chordogram - Golden


Cory wong’s golden is very uptempo song, which is not something very typical for him. The song also features a vocal performance that is similar to a pop song. Coincidentally this song also sticks to E flat 7 as a main chord, but you can see this is not the only chord used. This is probably because of the harmonies between different instruments in this song.

Chromatic and Cepstral Analysis

Cepstograms of Cory Wong by Vulfpeck (featuring Cory Wong)


We learn more about the song every time we increase the resolution of the graph. You can clearly see the end of the song has high values for the third cepstral component, which corresponds to the outro which features a live audience cheering.

Another striking part is the difference around the 100 second mark, before it the song has high values for cepstral features 2-4, while afterwards component 1 is the most prominent. The one thing that is clearly missing in the second part are high pitches on guitar. The bass and drums stop around the 90 second mark, but later return although less prominently.

In the first part we also see interesting patterns arise in the more detailed resolutions. Closer investigation will be done to explain these.

Here we see the Self Similarity Matrix for “Too hot in LA”


When analyzing the structure of the self similarity matrix for timbre(left) and pitch(right) there are some clear structures. The song has a clear pre-chorus and chorus. What is interesting is that the most prominent yellow pitch lines are only for the prechorus, meaning the chorus and verse are very similar in pitch. The main melody is provided by the bass, and indeed the bass part is the same for the chorus and verses, and different for the prechorus.

Introduction

This portfolio explores the albums and singles of Vulfpeck and their members and closely associated acts Theo Katzmann and Woody Goss and collaborative acts Joey Dosik and Cory Wong. Vulfpeck’s Joe Dart is praised as one of the best bassists to emerge in recent years, but that’s not the only reason for this choice. Firstly, Vulfpeck is known for their one-take style of recording and off-beat musical style. Secondly, the band has a few closely associated members that appear on a large section of their music. Finally, the band’s members have their own solo projects which often still feature each other. This allows us to compare the albums of these artists and see what makes their music unique.

The natural groups are the main artist on each album. Though they share a band there are some clear differences in genre, Vulfpeck is primarily funk, Theo Katzman is more slow love songs, Cory Wong has the danceability of pop music.

The albums and singles are a good representation of the artists. They all use Spotify as their primary way of distribution and have at least an album each.

Typical tracks for Vulfpeck are Cory Wong (yes that song is named after another artist in the list, it’s going to be confusing data!) or It Gets Funkier, the latter having the typical funk structure and interaction between instruments. An atypical song would be Theo Katzman’s As the Romans do. Whereas his solo music often has a halftime feel soulful style, this is an up-tempo track, more like what I would personally expect to come from Cory Wong’s repertoire.

It will be interesting to find out what makes each artists music truly theirs, even though they are so closely related and collaborative.

Data Exploration

Column

Valence by artistS

To my surprise, Cory Wong was shown to be the most ‘negative’ out of all four artists.

Tempo vs Dancability


To see where this difference is coming from I looked at some more low-level features such as tempo and mode. This plot gives a good interpretation of some of the differences between the artists.

It Gets Funkier

It Gets Funkier


Description

It Gets Funkier II

It Gets Funkier III

It Gets Funkier IV

Frame 5


Description

Here we compare the different versions of It Gets Funkier.

Despite their seeming similarity, I could not get the dynamic time-warping to show that result clearly.